Constrained Structural Maximum A for Average-Voice-Based
نویسندگان
چکیده
This paper proposes a constrained structural maximum a posteriori linear regression (CSMAPLR) algorithm for further improvement of speaker adaptation performance in HMM-based speech synthesis. In the algorithm, the concept of structural maximum a posteriori (SMAP) adaptation is applied to estimation of transformation matrices of the constrained MLLR (CMLLR), where recursive MAP-based estimation of the transformation matrices from the root node to lower nodes of context decision tree is conducted. We incorporate the algorithm into HSMM-based speech synthesis system and show that CSMAPLR adaptation utilizes both of the advantage of CMLLR and SMAPLR adaptation from the result of objective evaluation test. We also show that CSMAPLR adaptation provides more similar synthetic speech to the target speaker than CMLLR and SMAPLR adaptation from the result of subjective evaluation test.
منابع مشابه
Comparing the Voice Handicap Index Scores in Groups with Structural and Functional Voice Disorders
Objective: The effects of voice disorders vary from person to person. Occupation, work environment, life, and family reaction are variables that affect one’s perception of his/her own as an impaired voice. Voice Handicap Index (VHI) has not yet been used to compare the degree of voice disorders. Assuming that the quality of life may be different under a variety of voice disorders and that diffe...
متن کاملOPTIMAL CONSTRAINED DESIGN OF STEEL STRUCTURES BY DIFFERENTIAL EVOLUTIONARY ALGORITHMS
Structural optimization, when approached by conventional (gradient based) minimization algorithms presents several difficulties, mainly related to computational aspects for the huge number of nonlinear analyses required, that regard both Objective Functions (OFs) and Constraints. Moreover, from the early '80s to today's, Evolutionary Algorithms have been successfully developed and applied as a ...
متن کاملEmployees’ Organizational Voice: Investigating the Antecedents and their Structural Relations Using the ISM and Fuzzy MICMAC Method
This study is aimed to explore the structural relationships among the factors affecting the employees' voice. To do so, by reviewing the literature a series of factors influencing the occurrence of organizational voice was identified and then the opinions of 15 senior and middle managers and academic professors about the relationships between these factors were examined. Finally, data were anal...
متن کاملAverage of Fulfilling Patients' Expectations from Interactive Voice Response Appointment Systems And Websites in Selected Clinics in Isfahan
Aim: In this study, the average of patientschr('39') expectations of fulfillment from Interactive Voice Response (IVR) and clinicschr('39') websites to make appointments for patients in Isfahan have been investigated. Methods: The study is a cross-sectional survey. The research population was all patients referred to the clinic of Amin and Al-Zahra hospitals of Isfahan University of Medical ...
متن کاملCombining Vocal Tract Length Normalization with Linear Transformations in a Bayesian Framework
Recent research has demonstrated the effectiveness of vocal tract length normalization (VTLN) as a rapid adaptation technique for statistical parametric speech synthesis. VTLN produces speech with naturalness preferable to that of MLLRbased adaptation techniques, being much closer in quality to that generated by the original average voice model. By contrast, with just a single parameter, VTLN c...
متن کامل